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applications of the Basic indexing and Retrieval System (BIRS). BIRS is a 
collection of Fortran IV computer programs designed to allow educational 
and psychological researchers to use their own locally based computer to 
construct and maintain the type of information retrieval system which best 
meets their individual needs* Given a collection of informational elements, 
punched on cards (e,g, abstracts of research documents, descriptions a* 
instructional materials, etc,), the system operates upon this collection to 
produce a variety of information services--induding the preparation of 
computer generated books with author/ sub ject indexes, and the development 
of automated reference searches for informational elements relevant to 
specific queries. The present monograph (1) provides sufficient informa* 
tion about BIRS to permit the reader to decide whether the system can be 
applied to his particular information retrieval problem; and (2) describes 
how the reader may obtain copies of the system and its technical documenta- 
tion. 
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I. INTRODUCTION 

The present document is the final report for a research and develop- 
ment project* jointly supported by the Office of Education, Department of 
Health, Education, and Welfare-— and the College of Education, Michigan 
State University* 

The tnaior stated oblective of the project was to develop and test a 
svs tern of Fortran programs for information retrieval which could be xd.de ly 
used by educators (for instructional materials) and by behavioral scien- 
tists (for research information)* 

This principal objective was accomplished bv the, release of the ini- 
tial version of the Basic. Indexing and Retrieval System (BXRS/I) in Decem- 
ber, 1966* BIRS/I satisfied all of the objectives previously stated in 
the grant proposal submitted to the Office of Education* 

BIRS/I is capable of aiding with the basic operations of storage, 
indexing, and retrieval either with coordinate or weighted indexing* As 
proposed, BIRS/I was completely tested on a collection of psychological 
test materials, prior to release. 

Once the primary objective of the project was satisfied by BIRS/I, 
we turned our attention to the attainment of objectives only hinted at by 
the original research proposal: (1) the development of BIRS/II, a more 

advanced system capable of handling a thesaurus of synonyms, permitting 

*USOE Contract #0E- 6- 16-04, Project #5-1144. Improving the Accessibility 
of Educational Materials: 1. Retrieval of Educational and Psychological. 

Tests. John F. Vinsonhaler, Principal Investigator. 
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greater freedom in stating questions, and allowing highly sophisticated 
methods of automated indexing; (2) the dissemination of BIRS/I and BIRS/II 
to computer installations interested in using the system; and (3) the appli- 
cation of BIRS/I and BIRS/II to significant educational information retrie- 
val problems. 



Surprisingly all three of these hoped-for objectives have been— to 
some extent-- attained since the accomplishment of the contractual obliga- 
tion by the completion and res lease of BIRS/I in December, 1966. (These 

additional achievements were certainly not obtained within the limited 

»» 

budget* of the present project. Substantial additional funds were con- 
tributed by the College of Education and by other federally funded projects, 
which required the use of BIRS.) Briefly, the following major develop- 
ments may be mentioned. 



First, the advanced version of the Basic Indexing and Retrieval 
System (BIRS/II) has been completed and tested on the CPC 3600 . BIRS/II 
provides all of the improvements indicated previously and is now being modi 
fied for the U.S.A.S.I. Standard Fortran, to permit easier implementation 
at diverse computer installations. Complete descriptions for BIRS/I and 
BIRS/II are given in subsequent sections of the present report. The sys- 
tem and full documentation will be released for national distribution in 
December, 1967. 



! 



★Federal contribution to the present project was about $15,000. This 
amount was doubled by other contributions* 
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Second, a dissemination service for BIRS has been established through 
Educational Publication Services, College of Education, Michigan State 
University, East Lansing, Michigan * The following table summarizes the maj 
sources of information about BIRS/I and BIRS/II* 

Presently, over 100 letters of inquiry regarding BIRS have been 
answered, about 10 orders for system documentation packages have been 
filled, and at least five major computer installations are committed to 
the modification of BIRS for local use. 

Third, the Basic Indexing and Retrieval System. (BIRS/I and BIRS/II) 
have been widely applied to diverse information problems ranging from the 
automation of research libraries to the content analysis of protocols 
obtained in psychological research studies. Particular applications of 
BIRS are summarized in final sections of the present report. 
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TABLE 1: Sources of Information About BIRS/I and BIRS/II 



I. REPORTS 

Koppitz, E. G. (Editor) "Applications of Computers*" Data Processing 
for Education . Vol. 6, No. 5. June, 1967 „ 

Summary of the BIRS/I System as presented in a report given at 
the AERA convention. 

Millman, J. (Editor) "Personalized Information Retrieval Systems." 
AERA Educational Researcher . 1967. Supplement Issue: Informa- 

tion Storage, Retrieval and Dissemination. 

Brief summary of the BIRS/I system with indication of usefulness 
of system for development of local, personalized IR systems. 

Vinsonhaler, J.F. "BIRS: A System of General Purpose Computer Pro- 

grams for Information Retrieval in Educational Research" Paper 
presented at the AERA Annual Convention, 1966. 

Analytic review of BIRS/I emphasizing applications to educational 
research and practice. 

Vinsonhaler, J.F. "BIRS: A System of General Purpose Computer Pro- 

grams for Information Retrieval in the Behavioral Sciences" 
American Behavioral Scientist . 1967, JLO, 12-23. 

Analytic review of BIRS/I with emphasis upon social science 
applications of the system. 

Vinsonhaler, J.F. "BIRS: A System of General Purpose Computer Pro- 

grams for Information Retrieval. Learning Systems Institute 
Paper No. 39 (Revised) . College of Education, Michigan State 
University. 

t 

i 

Definitive research report on the BIRS/I programs including 
comments on applicability in education and social science, analysis 
of major weaknesses, and announcement of plans for second version, 
BIRS/II, to correct faults. 

Vinsonhaler, J.F. "Generalized Systems of Computer Programs for 
Information Retrieval" Learning Systems Institute Paper #55 . 
College of Education, Michigan State University, 1967. (Pre- 
sented at the 75th Annual APA Convention, 1967), 

Review of general purpose programming for IR; Summary of BIRS/II 
and applications; Relation of the system to other general purpose 
IR programming developments. 
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TABLE I (continued) 



XI. TECHNICAL MANUALS 

Vinsonhaler, J.F. (Editor) Technical Manual for the Basic Index - 
ing and Retrieval System . Educational Publication Services, College 

of Education, Michigan State University, East Lansing, Michigan. 

Jan: 1967 (First Edition BIRS/X), and January, 1968 (Second 

Edition BIRS/II, in press). 

The technical manual for BIRS/I is now being distributed at 
cost. It includes chapters providing general reviews of IR, 
and the analysis of information systems, as well as a tech- 
nical chapter for operating each component program. 

The manual for BIRS/II will be ready for distribution in 
December, 1967. This edition will include more general 
reading in IR, special chapters for users and programmers as 
well as expanded chapters on component programs. 

III. SYSTEM PROGRAMS 

BIRS DOCUMENTATION PACKAGE 

Educational Publication Services, College of Education, Michigan 
State University, East Lansing, Michigan. 

Packages of the materials necessary to implement BIRS at 
computer installations are being distributed from Educational 
Publication Services at coSt. Each package includes: 

(1) Fortran Programs and standard test data punched on cards. 

(2) Printed output from compile and execution of all programs 
using test data. 

(3) Three copies of BIRS Technical Manual. 

Currently the package includes BIRS/I documentation* After 
December, 1967, the package will include only BIRS/II documen- 
tation. 
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As noted in the above table, complete technical documentation for the 
present project is available from Educational Publication Services at Michigan 
State University.* Therefore, the remaining portions of the current monograph 
will be devoted to a non- technical report for BIRS. The organization is as 
follows: 

Section II . indicates the research context for BIRS by summarizing 
the salient attributes of a representative set of similar systems in opera- 
tion or currently under development. 

Section III , provides a summary of the major characteristics of BIRS/I 
and BIRS/ II, including indications of the unique design underlying the 
system. 

Section IV . reviews several representative applications of the system 
to information retrieval problems in education. 

Finally, Section V provides some conclusions drawn from our experience 
in developing and applying the system. 

The goal of the present monograph is to pr ovide a basic source of infor- 
mation about BIRS . Thus, this document should (1) permit an individual 
scientist or educator to decide whether or not the system may be used to 
solve his particular information retrieval problem and (2) indicate how 
complete documentation for the system can be obtained. 

*The complete BIRS Documentation Package will be made available without 
cost to the U.S.O.E. for possible dissemination through the Educational 
Research Information Center (ERIC). 




II. COMPUTER PROGRAMS FOR INFORMATION RETRIEVAL 



Information systems are usually analyzed into three basic elements: 
the source (origin of the information) ; the receiver (user of the informa- 
tion) ; and the communication subsystem (the storage and retrieval mechan- 
ism permitting effective transmission between the receiver and the source). 
(Meadow, 1967) The present report is concerned with the automation of 
communication subsystems by means of IR programming systems « i.e., collec- 
tions of programs designed to cause computers to act as communication sub- 
systems for complete IR systems. 

Since the present analysis places a major emphasis upon Generalized 
IR Programming Systems, some comment regarding the concept of system gener- 
ality is necessary. Generality is an important characteristic of all 
problem oriented programming systems, whether designed for statistical 
analysis or information retrieval. Generality is typically defined in 
terms of the range of operations performed by a system. For our purposes, 
we shall define generality as a combination of adaptability and portability. 
Adaptability refers to the degree to which a system may be adapted to a 
variety of information processing problems. Portability refers to the ease 
xtfith which the system may be implemented at various computer installations. 
Portability reflects both the machine independence of a system and the 
adequacy of its documentation. 

Although the relative merits of generalized versus special purpose 
programming systems are frequently debated by computer scientists and pro- 
grammers, current applications programming is clearly emphasizing generalized 
systems. The most probable reason for this is the average scientist's need 
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for the easier access to the computer provided by generalized systems. 

A familiar example of this trend is the development and rapid acceptance 
by behavioral scientists of generalized systems for statistical analysis- 
such as the Cooley and Lohnes (1962) system, the Dixon (1964) BMD System, 
and the BCTRY System (Tryon and Bailey, 1966). 

The trend toward generalized systems is also clearly apparent in IR 
programming. The development of generalized IR programming systems appear 
to be a rather obvious extension of two major areas of prior and concurrent 
program development. One basis for generalized systems is provided by 
programs designed to sort and merge alphabetic and numeric information. 

The best known examples are the Key-Word-In-Context (KWIC) programs which 
produce printed indexes by permuting key words selected from author-title 
abstracts (Luhn, 1966). A second basis for generalized systems is provided 
by the specialized automated searching programs developed for particular 
applications— principally in the defense related industries. Currently, 
generalized IR programming systems formed by combining printed indexing 
and automated searching programs appear to be in a rapid stage of develop- 
ment. A recent federal survey of nonconventional scientific and technical 
information systems lists 118 computer systems (NSF, 1966b). Of these, 20 
produce only printed indexes; 17 produce only automated searches; but 81 
produce both printed indexes and automated reference searches. 

The following table summarizes several typical generalized IR pro- 
gramming systems currently available for use by educators and behavioral 

scientists. 
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TABLE 2 Representative IR Programming Systems 



SYSTEM SOURCE 



RESUME 



GIS, Generalized IBM (1965) 

Information System IBM Corp. 



INFOL Information Olle (1967) 

Language CDC (1966) 

Control Data 
Corp. 



General purpose file maintenance and 
searching system with full informa- 
tion algebra. Record rather than 
document oriented. Assembly 
Language, IBM 360 Series. 

General purpose file maintenance and 
searching system with limited infor- 
mation algebra. Record oriented. 
Assembly Language, CDC 3000 Series. 



GPFS, General 
Purpose FORTRAN 
System 



Bentley & Katz 
(1967) 

Sperry Rand 



Limited purpose file maintenance and 
searching system with limited 
information algebra. Record 
oriented. FORTRAN. Limited to 
CDC 3000 Series due to dependence 
on SCOPE monitor. 



General Inquirer 



Rand Catalog 



Stone (1962) Linguistic content analysis system. 

Harvard University Given a dictionary thesaurus and 

text, system generates word fre- 
quency statistics for terms 
encountered in text. Limited 
retrieval capability. 



Kay & Ziehe (1965) Very flexible general purpose file 
Rand Corp. maintenance system designed for 

linguistic analysis. Permits 
definition of "data structures” 
linking information for retrieval. 
Assembly Language. 



TRIAL, Technique 
for Retrieving 
Information from 
Abstracts of 
Literature 



BIRS, Basic 
Indexing and 
Retrieval 
System 



Janda (1966) General purpose document retrieval 

Northwestern system, without information algebra 

University or linguistic analysis. Designed to 

retrieve abstracts in response to 
queries submitted by users. 

Assembly Language, IBM 7000 Series. 



Vinsonhaler (1967) General purpose document retrieval 
Michigan State system without information algebra. 

University Designed to perform automatic 

content analysis, storage, and 
retrieval. FORTRAN IV for CDC 3000 
6Q00. Modifiable for IBM 7000 and 
360 Series. 



Several useful comments may be made regarding the contents of the 
table. First, there appear to be several maior lines of development . The 
generalized system approach is best exemplified by GIS. This system is 
designed to be maximally adaptable. The user may make use of a complete 
"information algebra" permitting remarkable flexibility in communicating 
with his information file. The difficulties involved in generating such 
completely adaptable systems are indicated by the fact that GIS remains 
incomplete at the present writing. However, less general systems, such as 
INFOL and GPFS, are currently available. The research system approach is 
illustrated by the General Inquirer and the Rand Catalog. Both systems 
permit a specialized type of analysis aimed at a particular field of re- 
search— in this case linguistic analysis. The document retrieval system 
line of development is illustrated by the last two systems summarized in 
the table. Both TRIAL and BIRS are designed to store and retrieve docu- 
ments in the behavioral sciences. Second, the design of current systems 
appears to emphasize adaptability over portability . Thus, the most adapt- 
able systems tend to be the least portable. For example, GIS, INFOL, and 
the Rand Catalog are all written in the assembly language of particular 
computer systems. Third, a significant collection of general purpose IR 
programming systems is currently available for use in the behavioral 
sciences. 



Scientists embarking upon research projects requiring the development 
of IR systems are well advised to examine the literature for relevant pro- 
gramming systems. In this regard, educational and behavioral science 
journals offer only limited indications of current IR programming. More 
complete summaries are available in general reviews by agencies of the 






federal government (Berul, 1964; Fossum, 1966; NSF, 1966a; NSF, 1966b) 
and in recently published books in the field of information systems 
(Strauss, et al. , 1966; Carter, et al« , 1967; Schecter, 1967; Meadou, 1967), 











III. BASIC INDEXING AND RETRIEVAL SYSTEM (BIRS) 

A. PURPOSE 

The Basic Indexing and Retrieval System is a set of fundamental programs 
designed to allow scholars and scientists to use their own locally based 
computer to construct and maintain a variety of IR Systems. Thus, BIRS 
may be viewed as a set of essential IR tools. The research worker may use 
these tools to construct the type of IR System which best meets his immedi- 
ate needs. 

In general, BIRS may be adapted to the development of any IR System 
having the following characteristics. First , each of the informational 
elements (e.g., documents, summaries of documents, references to documents, 
etc.) retrieved by the IR System must be punched on cards. Second , for 
purposes of indexing or retrieval, each of these informational elements 
must be describable by a collection of terms (e.g., a set of keywords 
indicating the topical content of the element, etc.). Within these rather 
broad limits, BIRS may be used to construct and maintain a variety of IR 
Systems ranging from computer generated books with author/ subject indexes, 
to computer based searching and retrieval systems with automated retrieval 
of informational elements in response to users' questions. 

The development of BIRS has particularly emphasized three major char- 
acteristics which have proved especially important for general purpose 
systems: portability, simplicity, and adaptability. Portability refers 

to the ease with which BIRS, or IR Systems based upon BIRS, may be trans- 
ferred from one computer installation to another. In general, such systems 



o 
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should be easily implemented at most university computer centers, since 
BIRS is written entirely in a FORTRAN IV level language, and uses the mini- 
mum storage configuration generally available in scientific computer centers: 
32,000 words of core storage and six tape drives, BIRS was developed and 
tested on the Control Data Corporation 3600 at Michigan State University, 
Implementations of the system at other installations are now in progress. 

Simplicity refers to the ease of using BIRS or IR Systems based upon 
BIRS, In order to be of maximal value in education and behavioral science, 
BIRS should be easily used by any researcher, regardless of his technical 
background in IR or computer science. Accordingly, the system is construc- 
ted to handle IR processing in the most straightforward manner with a mini- 
mum degree of control required of the user. In addition, a comprehensive 
Technical Manual (Vinsonhaler, 1967) has been developed for the system 
which provides a background reading in IR, and thorough descriptions of all 
programs • 

Adaptability refers to the degree to which individual scholars may 
adapt the system to their own purposes. This characteristic is of particu- 
lar importance in educational and behavioral science IR, because of the 
diversity of informational needs and the necessity for experimentation with 
various types of IR methods within these fields. Adaptability implies both 
ease of modification and flexibility of use. Ease of modification is design- 
ed into the sytem by using so-called ”open-ended, modular” construction and 
by organizing the programming into independent clusters of dependent opera- 
tions, Thus, each program in BIRS performs a separate basic operation and 
may be modified or replaced without major changes in the remaining compon- 
ents of the system. Flexibility is built into the system by (1) completely 
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automating only those clerical operations common to most IR systems; (2) al- 
lowing the user to control the operations which are nonclerical and depen- 
dent upon his particular IR needs; and (3) providing the user with a choice 
among programs and among methods within a single program. 

B. THEORY 

Most currently functioning IR programming systems are designed to imple- 
ment rather specific types of IR Systems. Hence, such systems are developed 
ad hoc with minimal theoretical foundations. In contrast, BIRS is designed 
to implement many types of IR Systems. Consequently, an attempt has been 
made to establish an explicit theoretical basis for the methods used in 
developing the system. 

The general design of BIRS is based upon an extension of Vickery* s 
(1965) theoretical analysis of IR Systems. Essentially, this theoretical 
model is an abstraction of the functioning of most traditional libraries, 
and provides a basis for systems like BIRS, which permit users to develop 
automated or partially automated IR Systems, which are firmly based upon 
familiar traditional- library methods. 

The fundamental theoretical model for BIRS is summarized in Table 3 
and is discussed at length in the Technical Manual for the system. A 
complete discussion of analogous IR System models is available (Fossum, 

1966) . 

The BIRS theoretical model assumes three types of fundamental informa- 
tional elements: abstracts, descriptions , and questions. Three funda- 



mental operations are associated with these elements: in formation . stora g e , 
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indexing , and retrieval. These basic operations may be clarified by anal- 
ogy with traditional library operations. Information storage corresponds 
to the maintenance of the collection of books and periodicals in the tradi- 
tional library, i.e., the storage of documents for retrieval by access label 
or "call number." Information indexing corresponds to the maintenance of 
an index to the contents of the library, i.e., the maintenance of author, 
title, or subject catalogs. Finally, information retrieval operations are 
analogous to the process of searching author-title, or subject catalogs for 
the "call numbers" of relevant documents, and then retrieving the documents, 
themselves, by means of their "call numbers." 
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TABLE 3: 
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Fundamental Operations Performed by IR Systems 


OPERATION 


DESCRIPTION 


INFORMATION 

STORAGE 


These operations are concerned with the preparation and 
storage of the basic informational elements to be processed 
by the system. 


ABSTRACTING 


Abstracts (a) are defined as basic informational elements, 
e.g., summaries of documents. This process consists of the 
preparation of abstracts for input to the IR System. 


INFORMATION 

FILE 

MAINTENANCE 


Information files are defined as collections of abstracts. 
The contents of the file are sets of the form <L a ,a>, where 
a denotes any abstract and L a denotes a unique label or 
access number for the abstract. This operation consists of 
preparing or updating the file, so that abstracts may be 
located by means of the unique access labels. 


INFORMATION 

INDEXING 


These operations are concerned with the preparation and 
storage of descriptions of the topical contents of abstracts. 


ANALYSIS 


Descriptions (d a ) are defined as topical descriptions of 
abstracts, which are to be used to locate abstracts by 
subject content, e.g., sets of key-words. Analysis is 
concerned with the preparation of descriptions for each 
abstract in the Information File. 


DESCRIPTION 

(INDEX) 

FILE 

MAINTENANCE 


Description files are defined as indexes to information 
files, i.e., collections of descriptions and labels of the 
form <d a ,L a >. These files provide a link between topic 
descriptions (d a ) and abstracts (a) by means of the unique 
access labels. The operations consist of preparing or 
updating the contents of the index. 


INFORMATION 

RETRIEVAL 


These operations are concerned with the retrieval of 
abstracts stored in the information file <L a ,a>, by means 
of descriptions stored in the index or description file 
<d a ,L a >, in response to questions (q) submitted by users. 


QUESTIONING 


Questions are defined as requests for abstracts by topic. 

A question (c^) is a description of a class of abstracts, 
e.g., a set of key terms defining the type of information 
of interest. Questioning refers to the process of stating 
such search requests. 


SEARCHING 


Searching refers to the process of using the description 
file to locate relevant abstracts. Thus, searching includes 
reading questions (q) , comparing questions and descriptions 
(q,d a ), and storing* - the locations (q,L a ) of relevant 
abstracts. 


RETRIEVING 


Retrieving refers to the process of reading relevant 
abstracts from the information file, i.e., using the 
results of searching (q,L a ) to output relevant abstracts, 
(q*a). 
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Essentially, BIRS is an open-ended collection of independent computer 
programs organized according to this general theoretical model. Thus, the 
system includes a separate set of computer programs for each of the three 
fundamental operations of information storage, indexing, and retrieval. 
Each component program is designed to aid the user with a single funda- 
mental operation. To construct a particular type of IR System, the user 
simply selects the proper component programs. The main advantage of this 
"modular" design is flexibility. Since the fundamental operations are 
mutually independent, the component programs may be used or modified inde- 
pendently, Hence, different components may be combined in various ways to 
produce a variety of IR Systems* Similarly, the entire system may be 
selectively modified by replacing existing components (or adding new ones) 
for one fundamental operation— without any modification of components de- 
signed for other basic operations. 

The fundamental operations performed by BIRS are summarized in Table 4, 
As indicated in the table, the preparation of abstracts and questions must 
be performed by the user. All other fundamental operations may be per- 
formed by the user, performed by the user aided by BIRS, or performed en- 
tirely by BIRS, under the control of the user. 
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TABLE 4: Fundamental IR Operations Performed by BIRS 



OPERATION 


PERFORMED 
BY USER 


AIDED 
BY BIRS 


PERFORMED 
BY BIRS 


Information Storage 


Abstracting 


Yes 


No 


No 


Information File Maintenance 


Yes 


Yes 


Yes 


Information Indexing 


Analysis 


Yes 


Yes 


Yes 


Description File Maintenance 


Yes 


Yes 


Yes 


Information Retrieval 


Questioning 


Yes 


No 


V~ 


Searching 


Yes 


Yes 


Yes 


Retrieving 


Yes 


Yes 


Yes 












C. BIRS/I 

The f t, 1 ? ial version of the Basic Indexing and Retrieval System (BIRS/I) 
is now fully operative on the CDC 3600 at Michigan State University, Essen- 
tially, BIRS/I is designed to generate IR Systems using the so-called 
method of ’’coordinate index’’--descriptions of abstracts are limited to un- 
ordered sets of key descriptive terms. 

The computer programs included in BIRS are each designed to perform 
one of the three basic operations of IR Systems described in the previous 
section. Table 5 summarizes the functions performed by the seven programs 
included in the initial system. Briefly, BIRS/I includes a minimal collec- 
tion of component programs covering the three major operations. Thus, one 
program (1FMP) is provided to aid the user with information storage; three 
programs (DAP, DFMP, PIP) are provided for information indexing; and two 
(DFSP, IFRP) are provided for information retrieval. Finally, one addi- 
tional program (EXEC) is provided to aid with the manipulation of the 
system as a whole, e,g,, to call components into operation from the system 
library tape. 

In constructing any particular IR System with BIRS, the researcher may 
utilize various types of cards, tapes, and printed reports. The cards, 
tapes, and reports used or generated with the system may be classified 
according to the fundamental IR System operation which they serve. 
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TABLE 5: Component Programs of the Basic Indexing and Retrieval System (BIRS/I) 



OPERATION 


PROGRAM 


FUNCTIONS 


Executive 


EXEC 


The Executive Program is designed to store and retrieve 
component programs comprising BIRS. 


Information 

Storage 


IFMP 


The Information File Maintenance Program is designed to 
read informational elements from cards, assign a unique 
access number to each element, and store the element on 
the Information File Tape (IFT) , so that the element 
may be retrieved, given its access number. 


Information 

Indexing 

(Descriptive 

Analysis) 


DAP 


The Descriptive Analysis Program is designed to aid the 
user with the task of indexing or classifying informa- 
tional elements. In general, DAP reads informational 
elements either from the IFT or from cards and searches 
them for key words. Thus, DAP may be used either to 
perform a word frequency analysis or to automatically 
index informational elements. 


Information 

Indexing 

(Description 

File 

Maintenance) 


DFMP 


The Description File Maintenance Program is designed to 
read descriptions (i.e., sets of indexing terms) and 
access numbers of informational elements from cards and 
store them on the Description File Tape (DFT) to provide 
an index to the contents of the Information File Tape 
(IFT). The user may manually generate the card input « 
for the DFMP, or use either the IFMP or the DAP, to aid 
him with this task. 


Information 

Indexing 

(Printed 

Indexing) 


PIP 


The Printed Indexing Program is designed to prepare a 
traditional author or subject index using informational 
elements read from cards or from the IFT, or using 
descriptions read from cards or from the DFT. The IFMP 
may be used in conjunction with this program to generate 
a listing of informational elements organized by access 
number . 


Information 

Retrieval 

(Automated 

Searching) 


DFSP 


The Description File Searching Program is designed to 
read requests for particular types of informational 
elements (stated as sets of key terms) from cards, to 
search the DFT for relevant informational elements, and 
to store the access numbers of the most relevant 
elements on the Question File Tape (OFT) . 


Information 

Retrieval 

(Automated 

Retrieval) 


IFRP 


The Information File Retrieval Program is designed to 
read requests and access numbers of the relevant 
informational elements from the QFT, retrieve the 
corresponding informational elements from the IFT, and 
print both the requests and the relevant informational 
elements for the user. 
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Table 6 summarizes the main card input for IR Systems developed with 
BIRS/I, As shown, abstracts, descriptions of abstracts, and questions are 
all input on cards* The abstract and question cards must be prepared by 
the user. The descriptor cards may be prepared by the user or by BIRS pro- 
grams* 

It should be noted that not all of these types of cards are needed for 
all applications* 

Table 7 summarizes the tapes which may be used with IR Systems developed 
by BIRS/I programs. As with component programs and cards, each tape is 
concerned with one of the major IR operations: the IFT contains abstracts, 

the DFT contains descriptions of abstracts, and the QFT contains questions 
and the access numbers of abstracts relevant to the questions* 

As before, not all IR Systems will require the use of all tapes. In 
general, only completely automated systems require all tapes* Partially 
automated systems replace tapes with printed reports* 

Table 8 summarizes the printed reports used with IR Systems developed 
from BIRS/I programs. As indicated, each type of report is mainly concerned 
with one of the three major operations. 

As before, different types of IR Systems will require the use of differ- 
ent types of printed reports. Thus, systems emphasizing manual searching 
and retrieval will require the use of a printed information file and a 
printed index, while automated searching and retrieval systems will require 
printed reports containing abstracts relevant to specific questions* 
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To summarize the preceding discussion, BIRS/I is essentially a general 
purpose indexing and retrieval system* It is designed to permit the con* 
struction of IR Systems which locate summaries of more complete documents 
stored in auxiliary files* Thus, IR Systems developed with BIRS are 
analogous to the reference services offered by traditional libraries* Thus, 
BIRS/I programs may be used to produce a printed equivalent of the author* 
title, subject index or an automated approximation to the reference- 

librarian. 
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TABLE 6: Cards Used with Component Programs of BIRS/I 



OPERATION 



CARD TYPE 



FUNCTION 



Executive BIRS 
and Control Control 



Cards 



Control cards are used to select component programs 
and govern their operations. Each card is punched 
with a *$ in the first two columns, followed by a 
control phrase. For example, cards calling the IFMP 
would contain *$IFMP. 



Information 
Storage and 
Indexing 



Abstract 

Cards 



Abstract cards are used to input informational 
elements to be processed by the system. An abstract 
may consist of any set: of information punched on 
from one to fifty cards. Each abstract is identified 
by a unique "access number" or "label , normally 
assigned by BIRS programs under the user’s control. 

The following is an example of a very simple abstract 
punched on cards. „ TA ™.. 

*$ABSTRACT LAi3t,L 

BENNETT, G.K. , BENNETT, M„G. , WALLACE, W.L., AND 
WESMAN, A.G. COLLEGE QUALIFICATION TEST. PSYCHOLOGICAL 
CORP. GENERAL INTELLIGENCE FOR COLLEGE ENTRANTS AND 
ADULTS. SIXTH MENTAL MEASUREMENTS YEARBOOK, O.K. 

BUROS, GRYPHON PRESS 1965, ENTRY NO. 450. TEST 
LIBRARY NO. C1033. 



Information 
Storage and 
Indexing 



Descriptor 

Cards 



Descriptor cards are used to input topical descrip- 
tions of abstracts for further analysis. Descrip- 
tions are sets of key terms describing the topical 
content of abstracts. Each abstract is separately 
described by a set of up to 30 terms, identified by 
means of the "label" or access number of the abstract 
being described. Terms are separated by commas, and 
the latter portions of long terms are ignored. There 
are no other restrictions on the content of terms. 
Descriptor cards may be generated by the user or by 
BIR.S/I programs. The following is an example of a 
description of the abstract given above. ^ n 

*$DESCRIPT0R LABEL 

COLLEGE, APTITUDE, VERBAL, QUANTITATIVE, ADULTS 



Information Question 
Retrieval Cards 



Question cards are used to query automated IR 
Systems developed with BIRS/I, i.e., to request that 
component programs search for and retrieve abstracts 
relevant to given topics. Question cards are used 
to indicate the topics of interest. Each search 
request is indicated by means of a set of up to 30 
terms, separated by commas. The following is an 
example of a request for abstracts dealing with 
general intelligence for adults. 

*$QUESTI0N 

GENERAL, INTELLIGENCE, ADULTS 
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TABLE 7 : Tapes Used with Component Programs of BIRS/I 



OPERATION 


TAPE 


CONTENTS AND FUNCTION 


Executive 


BIRS 

Library 


The BIRS Library Tape contains component 
programs of the system* The programs are read 
from cards and stored on the tape by the 
Executive Program. Programs may be called from 
the tape by means of executive control cards. 


Information 
Storage and 
Retrieval 


IFT 


The Information File Tape contains abstracts 
read from cards and stored so that each can be 
located given its unique access number or 
"label." The IFT is generated by the IFMP and 
is used by the DAP , the PIP , and the IFRP . • 


Information 
Indexing and 
Searching 


DFT 


The Description File Tape contains key-term 
descriptions and access- numbers or "labels" 
of abstracts. Descriptions and "labels" are 
associated so that abstracts may be located 
by topical content. The DFT is generated by 
the DFMP from descriptor cards prepared by 
the user, or by the DAP. The tape is used.br 
the PIP, and the DFSP. 




QFT 


The Question File Tape contains questions 
read from question cards, and access numbers 
of relevant abstracts. The QFT is generated 
by the DFSP and is used by the IFRP. 
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TABLE 8: Printed Reports Used with Component Programs of BIRS/I 



OPERATION 


REPORT TYPE 


CONTENTS AND FUNCTION 


INFORMATION 

STORAGE 


Printed 

Information 

File 


This report consists of a printed 
equivalent of the Information File Tape, 
i.e., a listing of abstracts printed in 
numerical order of access number. The 
printed Information File is generated by 
the IFMP from abstract cards or from the 
IFT. 


INFORMATION 

INDEXING 


Content 

Analysis 

Report 


This report consists of a content analysis 
of word usage in abstracts, read either 
from cards or from the IFT. The report is 
generated by the DAP, which also may be 
used to automatically index each abstract 
by selecting key terms and punching them 
on descriptor cards. 




Printed 

Index 


This report consists of a printed 
equivalent of the Description File Tape, 
i.e., an index for locating abstracts by 
topic. Since this index is designed for 
manual searching, a term-entry index is 
prepared. That is, all key-terms are 
printed in alphabetical order and each 
term is followed by the "labels" or access 
numbers of abstracts relevant to the term. 
The printed index is produced by PIP from 
abstract cards, the IFT, descriptor cards, 
or the DFT. 


INFORMATION 

RETRIEVAL 


Abstracts 
relevant to 
questions 


This report consists of the results of an 
automated search performed by the DFSP. 
Each question originally stated on cards 
is printed and following it the relevant 
abstracts— in order of relevance. The 
report is generated by IFRP from the QFT 
and IFT. 
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D. BIRS/II 

The second version of the Basic Indexing and Retrieval System (BIRS/II) 
is to be released for national distribution (with a complete Technical 
Manual) in December, 1967. BIRS/II is identical to BIRS/I except for the 
following ex tent ions: (1) Coordinate and weighted i ndexing— terms or terms 

and numerical weights may be used to describe documents; (2) Word root 
analyses and svnonvm dictionaries --terms may be reduced to generic roots, 
and synonyms may be proved during indexing and/or searching; and (3) Ge n er » 
alized logical searching- user may call upon simplified relevance searching 
or sentential, logic searches for question ansxfering. Table 9 summarizes 
the functions of the ten computer programs comprising BIRS/II. 



The cards, tapes, and printed reports used with the component programs 
of BIRS/II are identical to those used with the initial system except for 
the addition of facilities permiting weighted indexing, word root analysis, 
synonym dictionaries, and logical searching. Table 10 summarizes the major 
types of additional card input available in BIRS/II. 
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TABLE 9; Cuuipanent Programs of the Basic Indexing and Retrieval System 



( FUNDAMENTAL COMPONENT 

| IR OPERATION PROGRAM FUNCTIONS OF COMPONENT PROGRAM 



Executive EXEC The Executive Program is designed to store and retrieve 

component programs comprising BIRS. 



Information 

Storage 


IFMP 


The Information File Maintenance Program is designed to 
read informational elements*from cards, assign a unique 
access number to each element, and store the element on 
the Information File Tape (IFT) , so that the element 
may be retrieved, given its access number. 


Information 

Indexing 

(Descriptive 

Analysis) 


DAP 


The Descriptive Analvsis Program is designed to aid the 
user with the task of indexing or classifying informa- 
tional elements. In general, DAP reads informational 
elements either from the IFT or from cards and searches 
them for key words. Thus, DAP may be used either to 
perform a word frequency analysis or to automatically 
index informational elements 




FAP 


The File Analvsis Program is a highly generalized content 
analysis program which operates on natural language text, 
p^p performs the same operations as DAP in the indexing 

process. 


Information 

Indexing 

(Description 

File 

Maintenance) 


DFMP 


The Descriptive File Maintenance Program is designed to 
read descriptions (i.e., sets of indexing terms) and 
access numbers of informational elements from cards and 
etnrp them on the Description File Tape (DFT) to provide 
an index to the contents of the Information File Tape 
(IFT) • The user may manually generate the card input 
for the DFMP, or use either the IFMP or the DAP, to aid 
him with this task. 


Information 

Indexing 

(Printed 

Indexing) 


PIP 


The Printed Indexing Program is designed to prepare a 
traditional author or subject index using informational 
elements read from cards or from the IFT, or using 
descriptions read from cards or from the DFT. The IFMP 
may be used in conjunction with this program to generate 
a listing of informational elements organized by access 
number. 




PLP 


The Printed Listing Program is designed to provide printed 
books, i.e., listings of abstracts, ordered by the 
contents of the abstracts, i.e., principal authors last 
name, etc. The printed books produced by PLP are similar 
to those produced by IFMP, except that the books of the 
latter are ordered by IFT access number. 



♦Documents or abstracts of documents not exceeding 50 cards in length 
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TABLE 9: (continued) 



FUNDAMENTAL 
IR OPERATION 


COMPONENT 

PROGRAM 


functions of component program 


Information 

Retrieval 

(Automated 

Searching) 


DFSP 


Tfco n*cr 1 -Hption File Searching Program is designed to read 
requests for particular types of informational elements 
(stated as sets of key terms) from cards, to search the 
DFT for relevant informational elements, and to store 
the access numbers of the most relevant elements on the 
Ouestion File Tape (QFT). 




DFSPL 


Th» nesM-iotion File Loeical Searching Program is similar 
to the DFSP except that it is designed to perform logical 

searches . 


Information 

Retrieval 

(Automated 

Retrieval) 


IFRP 


The Information File Retrieval Program is designed to 
read requests and access numbers of the relevant 
informational elements from the QFT, retrieve the 
corresponding informational elements from the IFT; and 
print both the requests and the relevant informational 

elements for the user* 
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TABLE 10: Additional Cards Used With Component Programs of BIRS/1I 



CARD TYPE 



FUNCTION 



Synonym Cards 



Descriptor Card,® 



Question Cards 



Inputs the thesaurus or synonym dictionary 
to be used in indexing or searching. The 
following example equates "ability" and 
"aptitude" with "intelligence." 

*$ SYNONYMS 

INTELLIGENCE = ABILITY, APTITUDE 



A new form of the descriptor card - 

the use of numerical weights to indicate the 

degree of importance for indexing terms. 

The following example indicates that 754 
of the items in the described test measure 
verbal ability, while only 257. of the items 
measure Quantitative ability. 



*$ DESCRIPTION 

VERBAL - .75, QUANTITATIVE * .25, 
INTELLIGENCE » 1.00 



The question cards used with BIRS/II may 
optimally include relevance weights or 
logical operators to further control the 
automated search. The first example indi- 
cates a relevance search with an emphasis 
upon tests measuring verbal rather than 
quantitative ability. The second example 
Illustrates a typical logical search ex- 
cluding quantitative ability tests. 
*$QUESTI0N 
INTELLIGENCE * 1.00, 

QUANTITATIVE = .25, 

VERBAL = .75 

*$QUESTI0N , m 

(INTELLIGENCE OR APTITUDE) AND (NOT 
QUANTITATIVE)-. 









IV. APPLICATIONS 

The Basic Indexing and Retrieval System (BIRS/I and BIRS/II) has been 
very well received by educators and behavioral scientists. 

The various reports regarding the system have generated a substantial 
correspondence which includes requests for further information, orders for 
complete documentation, and numerous pledges of cooperation in implementing 
the system at diverse computer installations. 

We have no way of knowing all of the applications currently being made 
with BIRS. The major applications familiar to the present author are 
summarized in Table 11. These applications are at best representative; they 
do not even include all of the BIRS applications at Michigan State University 
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TABLE 11. Representative Applications of BIRS/I and BIRS/II 



PROJECT 


BIRS APPLICATION 


Instructional Materials Center for 
Handicapped Children and Youth; 
MSU/USOE. Alonso and Ward, 
Principal Investigators 


Automate Regional Library for mater- 
ials center* Computer generated 
library catalogs, reference search- 
ing etc* on CDC 3600* 


Instructional Materials Center* 
University of Texas/U30E:HCY. 

W* Wolfe, Principal Investigator* 


Computer generated indexes for 
materials center collection, on 
CDC 6600* 


Single Concept Film Clip Project* 
MSU/USOE. Prof. Woodrow Miller, 
Principal Investigator* 


Computer generated indexed book for 
850 single concept films, on CDC 
3600. 


ERIC Council on Exceptional 
Children Clearinghouse* 

Dr* June Jordan, Principal 
Investigator* 


Preparation of accession lists for 
15 clearinghouses and searches for 
duplication of abstracting among 
clearinghouses, on CDC 3600* 


TEST Information Retrieval System 
Project. Vinsonhaler, Principal 
Investigator* 


Automated indexing and retrieval 
for psychological tests relevant 
to aptitude measurement* 


Michigan Inter-University Committee 
on Information Systems, (MICIS); 
jointly supported by University of 
Michigan, Michigan State University, 
Wayne State University, and the 
State of Michigan* 


Automated preparation of a continu- 
ally updated bibliography for com- 
puter applications in university 
education, including IR, CAI, and 
general applications of computers 
in university environments* 




laiiaaaiaiasaaai^^ t MmammiM 










32 

To clarify the nature of current BIRS applications, let us consider 
a particular example chosen from Table 11* The Ins tructional Materials 
Center for Handicapped Children and Youth is a regional library for docu- 
ments relevant to the education of visually handicapped children* Presently, 
the library includes about 6,000 major documents, e*g*, psychological tests, 
text books, periodicals, instructional devices, etc* All of the indexing 
and retrieval functions of the library are performed by BIRS, implemented 
on the university CDC 3600 System* 



The procedures are as follows* As new documents are acquired, they are 
assigned an arbitrary accession number and stored in the library. Next, 
an abstract is prepared for each document which includes bibliographic 
information, a summary, and the document accession number. The abstract 
is punched on cards for input to BIRS* 

Like most computer based libraries , the Materials Center provides users 
with both printed indexing services and automated reference searching. 

Thus, printed indexed books are provided in place of the traditional card 
catalogs while automated searching is provided in place of traditional 
reference service. The Materials Center staff uses BIRS mainly to prepare 
printed catalogs and to perform reference searches requested by users of 
the library* The procedures are summarized in the following figure* 
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Normally, the processing begins with the storage of new abstracts 
on an information file tape (step 1 in the figure). Next, each abstract 
is indexed or classified and description cards are prepared which contain 
the indexing terms. Usually, the indexing is automated (as shown in step 2 
of the figure). Finally, the description cards are input, and an index is 
constructed on a description file tape (step 3). Printed catalogs (step 4) 
are generated by programs which read abstracts and descriptions from the 
tapes. These programs then prepare books of abstracts and various types of 
key word indexes, e.g. , by author, title, subject, etc. Reference searches 
(step 5) are obtained by inputting question cards describing topics of 
interest to one of the BIRS searching programs which use the description 
fll e tape to locate relevant abstracts. The BIRS retrieval program reads 
the relevant abstracts from the information file tape and prints them for 
the user. 

Obviously, IR programming systems may be used for many purposes be- 
sides automating research libraries. For example, BIRS is currently being 
applied to produce reading lists for university courses, to prepare bibli- 
ographies for computer applications in education, to investigate methods of 
automated indexing and searching, and to perform content analyses of proto- 
cols for research in psychology. However, it must be admitted that the 
predominant usage of BIRS (and other similar systems) is in the automation 
cf research libraries. 
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CONCLUSIONS 



Several obvious conclusions may be offered with respect to the BIRS 
project. First , there appears to be a very real need for generalized IR 
programming systems in behavioral science and education. The success of 
BIRS probably stems from its provision for gradual steps beyond traditional 
library technology by the average university scientist or scholar. 

Second , our experience indicates that computerized IR Systems are 
substantially less expensive than manual systems for specialized libraries. 
For the 6000 document Instructional Materials Center collection. Computer 
Generated Catalogs (with author/ subject/ indexes) cost about $120 per 
original copy, while automated reference searches cost about $ .60 per 
search. One reason for this low cost (in contrast to that of other com- 
puterized IR Systems) stems from the direct participation of non-technical 
personnel in applying BIRS. 

The cost equations developed on the CDC 3600 (and probably applicable 
to other implementations) are as follows: 

$ Cost for KWIC Index ■ .002 (Number of Abstracts) X (Average Number 

of Cards Per Abstract) 

$ Cost for Each Ques- 
tion - .0001 (Number of Abstracts) 

The second factor in the first equation is the overall number of cards 
per abstract which are actually used in preparing the index. 

Third , perhaps the most significant finding of the present project is 
that technology rather easily outdistances understanding. Thus, we have 
found that many of the extentions of BIRS/I, included in BIRS/XI, are not 
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applied by users , because they lack a sound theoretical basis for such 
applications. Most of our users view elementary content analyses (e.g., 
word root analyses or synonym definition) with exaggerated suspicion. 

Further, the practical application of certain facilities**-such as 
probabilistic or weighted indexing (Maron & Kuhns, 1960)--are as much a 
puzzle to us as to our users. In short, we have come to the conclusion 
that further major developments in the technique of IR Programming Systems 
must necessarily be accompanied by careful study of the psychological or 
behavioral aspects of Information Retrieval and Communication. 
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